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ABSTRACT 

Several goodness oi fit (QOF) criteria have been 
developed to assist the researcher in interpreting structural 
equation models. However, the determination of GOF for structural 
equation models is not as straightforward as that for other 
statistical approaches in multivariate procedures. The four GOF 
criteria used across the commonly used statistical packages are: (1) 
chi square; (2) GOF; (3) adjusted GOF; and (4) root mean square 
residual. Two other approaches are the Tucker-Lewis Index (1973) and 
the Normed Fit Index (1980). other measures that provide model 
comparisons are the normed chi-square, the parsimonious fit index, 
and the Akaike Information Criterion. A new measure of model fit, the 
maximum internal correlation coefficient, is propose-" as a GOF 
criterion in structural equation models. This approach offers another 
test of model fit. Model testing is suggested for effectiveness and 
functionality of the specified variable relationships. Two tables 
summarize GOF criteria, and there is a 32-item list of references. 
(SLD) 
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INTRODUCTION 

Various statistical approaches have been proposed for 
conducting educational research. These statistical approaches are 
reviewed to provide a background for the development of structural 
equation modeling. Each statistical approach has a criteria by 
which statistical significance is indicated. These criteria or 
significance tests have been given the general name, Goodness of 
Fit (GOF) criteria. 

Multiple regression provided the important multiple 
correlation coefficient (R) and subsequent R-squared value (R 2 ) to 
determine the overall contribution of a set of variables to 
prediction. Multiple regression also permits full and restricted 
models to be tested for the significant contribution of each 
variable in a model. The use of all possible regressions can 
further delineate the best set of multiple independent predictors. 
The importance of multiple regression in explanation and prediction 
as well as the statistical tests have been well established 
(Fedhazur, 1982) . 

Path analysis is essentially multiple-multiple regression 
equations and as such utilizes the R 2 value f t-test, and F-test to 
examine overall model fit, reduction in paths, and to test separate 
partial (path) coefficients (Pedhazur, 1982, 607-614; Williams and 
Klimpel, 1974) . Another GOF criteria called a meaningfulness 
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criteria (As her, 1976, 32-34), examines the discrepancy between the 
original correlations in the matrix and those reproduced from the 
model based on direct, indirect, correlated, and spurious effects 
(Marascuilo and Levin, 1983, 169-172) . The chi-square statistic 
with df equal to the number of over identified restrictions has 
also been widely used (Pedhazur, 1982, 98) . Another GOF criteria 
termed Q (Pedhazur, 1982, 619-622) uses the R 2 value and depending 
upon sample sizes greater than 100 involves an adjustment using W. 
Boyle (1970) previously introduced path analysis capability using 
nominal and ordinal variable paths which was of concern in mixed 
models in multiple regression. 

Factor analytic models are often distinguished between 
exploratory and confirmatory varieties. In either case, the 
expressions of communal it y and uniqueness of a variable are just 
expressions of the general notion of predicted R 2 and residual 
variance (Loehlin, 1987, 20/ Kim and Mueller, 1978a and 1978b; 
Long, 1983) . Factor loadings are equivalent to path coefficients 
and variables or items which share communality on a factor assist 
in describing or labeling latent variables. 

Latent variable models or structural equation models are unique 
in that "factors" are related which distinquishes it from 
relationships and communalaties among observed/measured variables 
(Duncan, 1975; Bentler & Weeks, 1980; Lomax, 1982; Long, 1983; 
Plewis, 1985) . Structural equation models have become widely used 
in the social and behavioral sciences (Saris & Stronkhorst, 1984; 
Anderson, 1987; Fassinger, 1987; Bollen & Ting, 1991) . 
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GOODNESS-OF-FIT CRITERIA 



Several GOF criteria have been developed to assist the 
researcher in interpreting structural equation models. The 
determination of goodness-of-fit for structural equation models 
however is not as straightforward as with other statistical 
approaches in multivariable procedures such as the analysis of 
variance, multiple regression, path analysis, discriminant 
analysis, and canonical analysis. These multivariable methods use 
observed variables which are assumed to be measured without error 
and have statistical tests with known distributions. Structural 
equation modeling procedures have no single statistical test of 
significance which identifies a correct model given the sample 
data. Consequently, the statistical packages have developed a 
number of GOF criteria, unfortunately they do not provide all of 
the same GOF criteria (Table 1) . The four GOF criteria common 
across the statistical packages are: chi-square; goodness-of-fit; 
adjusted goodness-of-fit; and root mean square residual (except EQS 
program) . 



Insert Table 1 Here 



GOODNESS-OF-FIT CRITERIA TYPES 
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The various GOF criteria are typically used in combination to 
assess model fit , model comparison , and model parsimony (Hair/ et 
al. r 1992 r 489-496). All of the GOF criteria, except X 2 , do not 
have an associated statistical test of significance. Therefore, 
most GOF criteria range in value from 0 (no fit) to 1 (perfect fit) 
(Bentler, 1980; Baldwin, 1989) . Table 2 summarizes many of the GOF 
criteria with an associated level of acceptable fit and 
interpretation . 



Insert Table 2 Here 



MODEL FIT 

Model fit determines the degree to which the structural 
equation model fits the data. Model fit GOF criteria commonly used 
are: chi-square (X 2 ), goodness-of-fit index (GFI) , adjusted 
goodness-of-f it index (AGFX) , and root mean square residual (RMR) 
(Joreskog and Sorbom, 1989, 25) . These GOF criteria are based on 
differences between the observed (original, S) and model implied 
(reproduced, I) correlation/covariance matrix. 
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Chi -square (X 2 ) 



A large X 2 value relative to the degrees of freedom indicates 
that the observed and reproduced (estimated) matrices differ. 
Statistical significance indicates the probability that this 
difference is due to sampling variation. A low X 2 value with 
significance levels greater than .05 indicate that vhe two matrices 
are not statistically different. The researcher is interested in 
obtaining a low X 2 value with a non-signif icant probability ltivel. 
The X 2 test being non-significant indicates that the model fits the 
data, however, an uncertainty will always persist because other 
models are possible that may fit the data. Although the X 2 GOF 
criteria is the only statistical test procedure, it is sensitive to 
sample size because as sample size increases (generally above 200) , 
the X 2 test has a tendency to indicate a significant difference 
between equal models. In contrast, as sample size decreases 
(generally below 100) the X 2 test indicates non-signif icant 
differences between the observed and reproduced matrices in unequal 
models. The X 2 test is also sensitive to departures from 
multivariate normality of the observed variables. 

Three approaches are commonly used to calculate X 2 in latent 
variable models (Loehlin, 1987, 54-67) . They are maximum 
likelihood (ML) , generalized least squares (GLS) , and ordinary 
least squares (OLS) . Each approach estimates a best fitting 
solution and evaluates the model fit. The ML estimates are 
consistent, unbiased, efficient, scale invariant, scale free, and 



normally distributed if the observed variables meet the 
multivariate normality assumption. The GLS estimates have the same 
properties of the MLS approach under a less stringent multivariate 
normality assumption and provide an approximate chi-square test of 
model fit to the data. The ULS estimates do not depend on a 
normality distribution assumption, however, the estimates are not 
as efficient nor are they scale invariant or scale free. The ML X 2 
statistic is: X 2 - (n-1) F^; GLS X 2 statistic is: X 2 = (n-1) F gl .; 
and ULS X 2 statistic is: X 2 = (n-1) F ul .. Where: 

F^ = tr ( S T 1 ) - n + In |I| - In |Si / 
F ?lf = .5 tr [ ( S - Z ) S' 1 ] J ; 
F uA . - .5 tr [ ( S - Z ) 2 ] ; 
df = .5 (p+q) (p+q+1) - t ; 
t « total number of independent parameters estimated / 
n = number of observed variables ; and 
(p + q) = number of observed variables analyzed. 

Goodness-of-Fit (GFI) and Adjusted Goodness-of-Fit (AfiFI) Indices 



The GFI index is based on a ratio of the sum of the squared 
differences between the observed and reproduced matrices to the 
observed variances, thus allowing for scale. The GFI measures the 
amount of variance and covariance in S that is predicted by the 
reproduced matrix £. The GFI index can be computed for ML, GLS, or 
ULS estimates (Bollen, 1989, 276-277) . The GFI index for ULS 
estimates would be computed as: 



tr t(S - I) 2 )] 
GFI = 1 - 

tr (S 2 ) 

The AGFI adjusts the GFI index for the degrees of freedom of a 
model relative to the number of variables. The AGFI index is 
computed as: 

AGFI - 1 - (k/df) (1 - GFI) 
Where: 

k ■ number of variables in matrix I 
df = number of degrees of freedom in model 
The GFI and AGFI indices can be used to compare the fit of two 
different models with the same data or compare the fit of models 
with different data. 

Root Mean Square Residual (RMR) 

The RMR index uses the square root of the mean squared 
differences between matrix elements in S and I. It is used to 
compare the fit of two different models with the same data. The 
RMR index is computed as: 

RMR = [ (1/k) I i3 { e AJ - o Li ) 2 ] 1/2 

MODEL COMPARISON 

Given the role chi- square has in model fit of latent variable 
models, two others have emerged as variants for model comparison: 
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Tucker-Lewis Index (TLX) and Normed Fit Index (NFI) (Bentler & 
Bonett, 1980, 1982; Loehlin, 1987, 68) . These GOF criteria 
typically compare a proposed model to a null model. In the EQS 
program, the null model is indicated by the Independence Model Chi- 
square value. The null model could also be any model that 
establishes a base for expecting other models to exceed. 

Tucker-Lewis Index (TLI) 

Tucker and Lewis (1973) developed this index initially for 
factor analysis, but later extended it to structural equation 
modeling. The measure can be used to compare alternative models or 
a proposed model against a null model. It is computed using the X a 
statistic as follows: 

( X null / df m»ll) ~ ( X ' 
TLI " 

<X 2 BUll / df BnU ) - 1 

Normed Fit Index (NFI) 

The NFI is a measure which rescales chi -square into a 0 (no 
fit) to 1.0 (perfect fit) range (Bentler & Bonett f 1980). It is 
used to compare a proposed model to a null model as follows: 

NFI = 

A null 
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MODEL PARSIMONY 

Parsimony refers to the number of estimated coefficients 
required to achieve a specific level of fit. Basically, an "over 
identified" model is compared to a "restricted" model. The AGFI 
measure discussed previously will also provide an index of model 
parsimony. Others which provide model comparison are: Nonrad Chi- 
square (NC) ; Parsimonious Fit Index (PFI) / and Akaike Information 
Criterion (AIC) . These indices indicate parsimonious goodness of 
fit, taking into account the number of parameters required to 
achieve a given level of chi- square. 

Normed Chi-square (NC) 

Joreskog, K.G. (1969) proposed that X 2 be adjusted by the 
degrees of freedom to assess model fit. The NC measure can 
identify inappropriate models in two ways: (1) a model which is 
"over- identified" and capitalizes on chance; or (2) models that do 
not fit the observed data and need improvement. The NC measure, 
like many others is affected by sample size. It is calculated as: 
X 2 

X 2 = 

df 

Parsimonious Fit Index (PFI) 

The PFI measure is a modification of the NFI measure (James, 
Muliak, & Brett, 1982) . The PFI however takes into account the 

• 11 

ERIC L A 



number of degrees of freedom used to obtain a given level of fit. 
Parsimony is achieved with a high degree of fit for fewer degrees 
of freedom in specifying coefficients to be estimated. The FFX is 
used to compare models with different degrees of freedom and is 
calculated as: 

PFI - ( df picowd /df null ) x NFI 

Akaike Information Criterion (AIC) 

The AIC measure is used to compare models with differing number 
of constructs similar to the PFI (Akaike, 1987) . The AIC measure 
will always be negative, but values close to zero indicate a more 
parsimonious model. It indicates both model fit (S and I elements 
similar) and a model not "over- identified" (parsimony) . The AIC 
measure is calculated as: 

- X 2 
AIC = 

2 

Muliak et al. (198 9) evaluated the X 2 , NFI, GFI, AGFI, and AIC 
goodness-of-f it indices. They concluded that these indices fail to 
assess parsimony and are insensitive to misspecif ication of causal 
relationships. Moreover, they recommend an approach that assesses 
relative fit of the structural equation model among latent 
variables (model fit) independent of assessing the fit of the 
hypothesized relations of indicator variables to latent variables 
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(measurement model) . Their rationale is that with few latent 
variables, most parameter estimates involve relations among 
observed indicator variables of latent variables and therefore 
measurement model estimates rather than latent variable 
relationship estimates determine a greater proportion of the 
covariance structure explained. They propose the following 
adjustment to separately estimate the effects of the latent 
variable estimates from the measurement model estimates: 

RNFIj = ( F u - F j ) / [ F u - F m - ( d., - d. ) ] 

Where: 

F a = X 2 of full model 

Fj = X 2 of structural equation model (latent variable model) 
F n = X 2 of confirmatory factor model (measurement model) 
d j ■ degrees of freedom for structural equation model 
d,,, * degrees of freedom for measurement model 

A corresponding relative parsimony ratio is given by: 

RP J -{d j -d m ]/[cl tt -d 1B ] 

Where: 

d } = degrees of freedom for structural equation model 
d,,, = degrees of freedom for measurement model 
d,, = degrees of freedom for null model 
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To compare different models for fit, multiply RPj x RNFI j to obtain 
a relative parsimonious fit index appropriate for assessing how 
well and to what degree the models explain relationships both in 
indicator variable measurement of latent variables and among the 
the latent variables. 

NEW MEASURE OF MODEL FIT 

Recent developments have offered another approach to assessing 
model fit (Joe and Mendoza, 1989a) . The maximum internal 
correlation coefficient has been recommended for further 
development in structural equation models as a GOF criteria (Joe 
and Mendoza, 1989b, 243) . It could be used to test the equality of 
pairs of eigenvalues from a correlation matrix or test the null 
hypothesis of independence in a set of variables. It is also 
useful in resolving "heywcod cases" and multicollinearity problems 
(identifies which variables are causing the problem) . There are 
several potential problems however in that several internal 
correlations are possible (pairwise combinations of eigenvalues of 
a correlation matrix) and the distribution of the statistic is 
unknown. Venables (1980) however has presented the distribution of 
the internal correlation and a test of significance which 
approximates an F distribution under the special case of a 2 x 2 
covariance matrix. The maximum internal correlation is calculated 
as : 
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Where: 

X x - largest eigenvalue in the population correlation matrix 
Aj, = smallest eigenvalue in the population correlation matrix 

The author proposes that the application of the internal 
correlation coefficient to structural equation modeling would take 
the form of: 

Where: 

A-s <* largest eigenvalue in original correlation matrix 
A z « largest eigenvalue in reproduced correlation matrix 

This would provide an overall measure of the comparison of equality 
in the elements of the original and reproduced correlation 
/covariance matrices or between two proposed model correlation 
matrices . 

SUMMARY 

Structural equation modeling requires the use of various GOF 
criteria in combination to interpret a model. Additionally, 
appropriate sample size, standardization of variables, and 
normality determinations need to be considered. Appropriate sample 
size can be determined using the Critical N (CN) statistic 
(Hoelter, 1983) which is: CN = (X 2 / F) + 1. CN gives the sample 
size at which the F value would lead to a rejection of H 0 : S = 2 at 
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a specified alpha level. The CN statistic is output by the 
SAS/CALIS progrun and indicates the recommended sample size. 
Standardization of variables occurs automatically when inputing a 
correlation matrix without mean and standard deviation values 
specified. This however doesn't occur when inputing a covariance 
matrix therefore caution is advised in selecting and interpreting 
GOF criteria because the variables may not be on the same scale of 
measurement. Normality of variables should further be examined 
prior to inputing a correlation or covariance matrix in structural 
equation models. As noted previously, several GOF criteria have 
evolved as adjustments to the X 2 and subjective interpretation has 
been made easier with scaling from 0 to 1. However, GOF criteria 
must be interpreted in view of sample size, standardization of 
variables, and the normality assumption. 

The new GOF criteria, internal correlation coefficient, holds 
the possibility of yet another statistical test of model fit once 
the distributional properties are better known. However, model fit 
is of itself a subjective approach since no single "correct" model 
is being determined. Other models may be equally plausible given 
the sample data. What then is the solution to this dilemna? The 
author suggests model testing for effectiveness and functionality 
of the specified variable relationships. This implies that the 
variables are chosen such that they can be manipulated to examine 
(test) the effect upon other variables in the model. Model testing 
takes the researcher to the next step beyond determining whether 
the model "fits" the sample data. 

IB 
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Table 1: Goodness-of-fit Criteria in Statistical Packages 



Package /Program 



GOF Criteria 



SAS / CALIS 



Chi -square 

Goodness-of-Fit Index 

Adjusted Goodness-of-Fit 

Root Mean Square Residual 
♦Bentler Comparative Fit Index 
♦Normal Theory Reweighted LS Chi-square 

Akaikes Information Criterion 
♦Consistent Information Criterion 
*Schwarz Bayesian Criterion 
♦McDonald Centrality 

♦Bentler & Bonett Non-normed Fit Index 
Bentler & Bonett Normed Fit Index 
James, Muliak, & Brett Parsimonious Fit Index 

♦Wilson & Hilferty Z-test 

♦Bollen Normed Index Rhol 

♦Bollen Non-normed Index Delta2 
Hoelter Critical N 



SPSS / LISREL 7 



BMDP / EQS 



Scientific 
Software/ LISREL 7 



Chi-square 

Goodness-of-Fit Index 
Adjusted Goodness-of-Fit Index 
Root Mean Square Residual 

Chi-square 

Goodness-of-Fit Index 
Adjusted Goodness-of-Fit Index 
Akaikes Information Criterion 

*Bozdogan Consistent AIC 
Normal Theory Reweighted LS Chi-square 

♦Bentler & Bonett Non-normed Fit Index 
Bentler & Bonett Normed Fit Index 

*Bentler Comparative Fit Index 



Chi-square 

Goodness-of-Fit Index 
Adjusted Goodness-of-Fit Index 
Root Mean Square Residual 



Note: The following require individual calculation: 
Tucker-Lewis 
Normed Chi-square 
Internal Correlation 

These indices are not covered in the manuscript, consult the 
respective statistical user guide for a reference. 
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Table 2; GOF criteria, acceptable fit levels and interpretation. 
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GOF Criteria 



Acceptable Level 



Interpretation 



Chi -square 



Tabled X 2 value 



Compare obtained X 2 value 
to tabled value for given 
degrees of freedom. ML and 
GLS estimates preferred for 
covariance matrix and ULS for 
correlation matrix. 



Goodness-of-Fit 
(GFI) 

Adjusted GFI 



Root Mean Square 
(RMR) 



0 (no fit) to 

1 (perfect fit) 

0 (no fit) to 

1 (perfect fit) 

Researcher 
defines level 



Value close to .9 reflects a 
good fit. 

Value adjusced for df with .9 
reflecting a good model fit. 

Relationship to observed S 
residual variance/ covariance 



Tucker-Lewis 



Normed Fit Index 



0 (no fit) to 

1 (perfect fit) 

0 (no fit) to 

1 (perfect fit) 



Value close to .9 reflects a 
good model. 

Value close to . 9 reflects a 
good model. 



Normed Chi-square 1.0 to 5.0 



Parsimonious Fit 
Index 

Akaike 

Information 

Criterion 



0 (no fit) to 

1 (perfect fit) 

0 (perfect fit) 

- (poor fit) 



Less than 1.0 is a poor model 
fit. More than 5.0 reflects 
need for improvement. 
Differences of .06 to .09 
when comparing models. 

Compare alternative models. 



Compare alternative models. 



Internal 
Correlation 



0 (S » E> to 

1 (S * I) 



Model Fit between S and 
reproduced Z. 



>0 



